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DETAILED ACTION 
Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

1. Claims 1-5, 13, 16, 19 and 20 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Julliard (European Patent Application EP 0 886 226 A1), hereinafter 
referred to as Julliard in view of Voorhees ("Using WordNet for Text Retrieval" in 
WordNetAn Electronic Lexical Database edited by C. Fellbaum), hereinafter referred to 
as Voorhees. 

Regarding claim 1 , Julliard discloses a linguistic search system and includes the 
following methods: analyzing a text database to determine whether there is a match 
between a query and a portion of said text database where the match is done on a 
tagged form of the database (col. 1, lines 40-44, col. 5, lines 7-57), which corresponds 
to "analyzing said natural language text corpus with respect to surface structure for 
word tokens and surface syntactic roles of constituents"; indexing the text corpus (col. 5, 
lines 7-44), which corresponds to "indexing and storing the analyzed natural language 
text corpus"; analyzing the natural language query (col. 1, lines 31-40), which 
corresponds to "analyzing a natural language query with respect to surface structure of 
word tokens and surface syntactic roles of constituents"; converting the natural 
language expression to a tagged form including one or more words with variations 



Application/Control Number: 09/599,563 Page 3 

Art Unit: 2654 

having the same initial meaning, and for each word a part-of-speech tag (col. 1, lines 
35-40, col. 4, lines 8-14), which corresponds to "creating a number of surface variants of 
the analyzed natural language query . . ., and for at least one surface variant by 
rearranging word tokens of said natural language query, in such a way that said number 
of surface variants are equivalent to said natural language query with respect to lexical 
meaning of word tokens and surface syntactic roles of constituents"; 

But Julliard does not specifically teach "replacing word tokens of said natural 
language query." However, the examiner contends that this concept was well known in 
the art, as taught by Voorhees. 

In the same field of endeavor, Voorhees teaches the use of WordNet for 
information retrieval where a natural language query can be expanded using different 
words to express the same concept (p. 295, 1J2). 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Julliard by specifically providing the features 
as, as taught by Voorhees, since this technique allows an improved ability to express 
the relevant concept during search (p. 295, P). 

In addition, Julliard teaches analyzing the text database to determine whether 
there is a match with the natural language expression that includes variations (col. 1, 
lines 41-45, Fig. 2, itesm s4 and s5, col. 4, lines 8-14, col. 5, lines 7-58), which 
corresponds to "comparing said number of surface variants and said analyzed natural 
language query with the indexed and stored analyzed natural language text corpus"; 
and determining the location of a match with said regular expression or index (col. 1 , 
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line 57 through col. 2, line 1 , Fig. 1 . s6, col. 5, lines 6-44), which corresponds to 
"extracting from said indexed and stored analyzed natural language text corpus, each 
portion of text comprising a string of word tokens that matches any one of said surface 
variants or said analyzed natural language query." 

Regarding claim 2, Julliard in view of Voorhees teaches everything claimed, as 
applied above (see claim 1); in addition, Julliard teaches the use of a part-of-speech 
tagger to determine a simplified part-of speech of each word (e.g., NOUN, VERB, ADJ) 
and the generation of rules based on a grammar (col. 3, lines 39-55, col. 4, lines 8-55, 
Fig. 2 50 51 52), which corresponds to "in the step of creating, said surface syntactic 
roles of constituents are head and modifier roles, and grammatical relations." 

Regarding claim 3, Julliard in view of Voorhees teaches everything claimed, as 
applied above (see claim 1); in addition, Julliard teaches that a regular expression, 
including word tokens tagged for part-of-speech, is matched against the corpus using 
an indexing scheme to pinpoint text (Fig. 2 s4 52 s5 s6, col. 5, lines 7-58), which 
corresponds to "in the step of extracting, a string of word tokens in said indexed and 
stored analyzed natural language text corpus matches one of said surface variants or 
said analyzed natural language query if it comprises the head words of phrases bearing 
the grammatical relations of subject, object, and lexical main verb in said one of said 
surface variants or said analyzed natural language query in the same linear order as in 
said one of said surface variants or said analyzed natural language query." 

Regarding claim 4, Julliard in view of Voorhees teaches everything claimed, as 
applied above (see claim 1). In addition, Julliard teaches that the text corpus and the 
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query are both tagged (col. 1 , lines 45-56, col. 5, lines 6-25, note (1 )), which 
corresponds to "in the step of analyzing a natural language query, said natural language 
query is analyzed in the same manner as said natural language text corpus is analyzed 
in the step of analyzing said natural, language text corpus." 

Regarding claim 5, Julliard in view of Voorhees teaches everything claimed, as 
applied above (see claim 1). In addition, Julliard teaches the following: generating a 
part-of-speech tag that includes a morphological feature marker where both the query 
and the text under go part-of-speech tagging (col. 1, lines 45-53, col. 5, note (1)), which 
corresponds to "determining a morpho-syntactic description for each word token of said 
natural language text corpus; and determining a morpho-syntactic description for each 
word token of said natural language query"; generating regular expression 
representations that can represent phrases and locating the regular expressions within 
the database where a regular expression inherently determines the type of the phrase 
(col. 1, line 45, col. 2, line 1, col. 5, lines 6-19), which corresponds to "locating phrases 
in said natural language text corpus; determining a phrase type for each of said 
phrases"; locating the position of sentences within the database (col. 5, lines 37-39), 
which corresponds to "locating clauses in said natural language text corpus." In 
addition, during the analysis of the query, Julliard teaches the application of grammar 
rules to one or more words (i.e., word, phrase or clause) deriving a regular expression 
representation where the regular expression inherently determines the type of the 
phrase (or clause) (col. 1, lines 38-41), which corresponds to "locating phrases in said 
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natural language query; determining a phrase type for each of said phrases; and 
locating clauses in said natural language query." 

Regarding claim 13, Julliard in view of Voorhees teaches everything claimed, as 
applied above (see claim 1). In addition, Julliard teaches that a regular expression is 
matched with the corpus (Fig. 2 s5) or an indexing mechanism is used to locate a 
sentence or file (document) and that the matched expression (or indexed entity) is 
pinpointed in the original text (Fig. 2 s6), which corresponds to "in the step of extracting, 
a portion of text that is extracted is either the matching string of word tokens, a clause 
comprising the matching string of word tokens, a sentence comprising the matching 
string of word tokens, a paragraph comprising the matching string of word tokens, or a 
document comprising the matching string of word tokens." 

Regarding claim 16, Julliard in view of Voorhees teaches a system for linguistic 
search with the following features: a part-of-speech tagger that analyzes both the text 
corpus and the query (col. 1 , lines 45-55, col. 5, line 6-18), which corresponds to "a text 
analysis unit for analyzing a natural language text corpus and a natural language query 
with respect to surface structure of word tokens and surface syntactic roles of 
constituents"; storage means for storing the tagged text corpus (Fig. 1 38), which 
corresponds to "storage means operatively connected to said text analysis unit, for 
storing the analyzed natural language text corpus"; an indexer inherently connected to 
the storage means (Fig. 1, col. 5, lines 19-44), which corresponds to "an indexer, 
operatively connected to said storage means, for indexing the analyzed natural 
language text corpus; an index, operatively connected to said indexer, for storing said 



Application/Control Number: 09/599,563 Page 7 

Art Unit: 2654 

indexed analyzed natural language text corpus"; the ability to process the query that 
includes tagging and the generation of variations (Fig. 2, s1 s2 s3 s4, col. 1, lines 45- 
56, Fig. 1, col. 4, lines 8-14), which corresponds to "a query manager, operatively 
connected to said text analysis unit, comprising means for creating surface variants of 
said natural language query by . . . rearranging word tokens of said natural language 
query in such a way that said surface variants being equivalent to said natural language 
query with respect to lexical meaning of word tokens and surface syntactic roles of 
constituents"; 

But Julliard does not specifically teach "replacing word tokens ... of said natural 
language query." However, the examiner contends that this concept was well known in 
the art, as taught by Voorhees. 

In the same field of endeavor, Voorhees teaches the use of WordNet for 
information retrieval where a natural language query can be expanded using different 
words to express the same concept (p. 295, 1J2). 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Julliard by specifically providing the features 
as, as taught by Voorhees, since this technique allows an improved ability to express 
the relevant concept during search (p. 295, 1J2). 

In addition, Julliard teaches the ability to match the analyzed query to the text 
database (col. 1, line 56 through col. 2, line 1, Fig. 2 s5, col. 5, lines 6-43), which 
corresponds to "and means for comparing said surface variants and said analyzed 
natural language query with the indexed analyzed natural language text corpus in said 
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index"; the ability to pinpoint the matched expression in the original text (Fig. 2, s6, col. 
2, lines 31-44), which corresponds to "a result manager operatively connected to said 
index, for extracting, from said indexed and stored analyzed natural language text 
corpus, each portion of text comprising a string of word tokens that matches any one of 
said surface variants or said analyzed natural language query." 

Regarding claims 19 and 20, Julliard in view of Voorhees teaches everything 
claimed, as applied above (see claim 1); in addition, Julliard teaches that the steps, 
operations or manipulations described can be implemented by means of a software 
controlled processor with machine readable code (col. 3, lines 21-26), which 
corresponds to "(claim 19) [a] computer readable medium having computer-executable 
instructions for a general-purpose computer to perform the steps recited in claim I; and 
(claim 20) [a] computer program comprising computer-executable instructions for 
performing the steps recited in claim 1." 

2. Claims 6-12 and 18 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Julliard in view of Voorhees and Arampatzis et al. ("An Evaluation of Linguistically- 
motivated Indexing Schemes," Technical Report CSI-R9918, University of Nijmegen, 
Sept 1999), hereinafter referred to as Arampatzis-lndexing. 

Regarding claim 6, Julliard in view of Voorhees teaches everything claimed, as 
applied above (see claim 5); in addition, Julliard teaches the use of an indexing 
mechanism applied to the tagged text corpus, but Julliard does not specifically teach the 
"[(a)] providing, for each word token of said natural language text corpus with, a unique 
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word token location identifier; [(b)] storing information regarding the location of each 
word token of said natural language text corpus, based on said unique word token 
location identifiers; [(c)] storing, for each phrase type, information regarding the location 
of each phrase of this type in said natural language text corpus, based on said unique 
word token location identifiers; and [(d)] storing information regarding the location of 
each clause in said natural language text corpus, based on said unique word token 
location identifiers"; However, the examiner contends that this concept was well known 
in the art, as taught by Arampatzis-lndexing. 

Arampatizis-lndexing teaches linguistically-motivated indexing schemes based 
on part-of-speech tagging and shallow parsing (abstract) where Arampatzis-lndexing's 
representational choices include w ((a) (b), above), Lnj, Lnv ((c) (d), above) (an 
indexing for all word-forms, lemmatized nouns and adjectives (phrases), and nouns and 
verbs (clauses) found in the text. (§3). 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Julliard in view of Voorhees by specifically 
providing a unique word token location identifier, as taught by Arampatzis-lndexing, for 
the purpose of having access to a variety of individual representational choices. 

Regarding claim 7, Julliard in view of Voorhees and of Arampatzis-lndexing teach 
everything claimed, as applied above (see claim 6); in addition, Julliard teaches that the 
part-of-speech tagger is applied to the corpus (i.e., the type of each word is included in 
the tagged corpus) (col. 1, note (1)), which corresponds to "storing each word type of 
said natural language text corpus." But Julliard in view of Arampatzis-lndexing do not 
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specifically teach "storing, for each word token, its unique word token location identifier 
logically linked to the stored associated word type." However, the examiner contends 
that this concept was well known in the art, as taught by Arampatzis-lndexing. 

Arampatzis-lndexing further teaches the use of a word representation indexing 
set w to index all words in the corpus (§'s 2, 3, and 4). 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Julliard in view of Voorhees and Arampatzis- 
lndexing by specifically providing word token location indexing, as taught by 
Arampatzis-lndexing, for the purpose of accessing all the words in a corpus. 

Regarding claim 8, Julliard in view of Voorhees and Arampatzis-lndexing teach 
everything claimed, as applied above (see claim 7). But Julliard in view of Arampatzis- 
lndexing do not specifically teach that the location of phrases comprises the step of 
"[(a)] providing, for each phrase of said natural language text corpus, a unique phrase 
location identifier identifying the word tokens spanned by the phrase; [(b)] storing each 
phrase type of said natural language text corpus; and [(c)] storing, for each phrase, its 
unique phrase location identifier logically linked to the stored associated phrase type." 
However, the examiner contends that these step were well known in the art, as taught 
by Arampatzis-lndexing. 

Arampatzis-lndexing further teach the use of indexes (location identifiers) 
associated with Lnj (lemmatized nouns and adjectives) and Lnjv (lemmatized nouns, 
adjectives and verbs) (§'s 2, 3, and 4), which corresponds to (a), above; creating phrase 
frames that are associated with each semantically distinct phrase (i.e., type of phrase) 
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(Fig. 1, §2), which corresponds to (b), above; where each phrase has an index and a 
phrase frame (location and type) (§'s 2, 3), which corresponds to (c), above. 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Julliard in view of Arampatzis-lndexing by 
specifically providing steps indicated above, as taught by Arampatzis-lndexing, for the 
purpose of supporting an approach consistent with the Phrase Retrieval Hypothesis (§2, 
HD. 

Regarding claim 9, Julliard in view of Arampatzis-lndexing teach everything 
claimed, as applied above (see claim 7). But Julliard in view of Arampatzis-lndexing do 
not specifically teach that the location of phrases comprises the step of "[(a)] providing, 
for each clause of said natural language text corpus, a unique clause location identifier; 
[(b)] identifying the word tokens and phrases spanned by the clause; [(c)] storing, for 
each clause, its unique clause location identifier." However, the examiner contends that 
these steps were well known in the art, as taught by Arampatzis-lndexing. 

Arampatzis-lndexing further teach the use of indexes (location identifiers) 
associated with Lnjv (lemmatized nouns, adjectives and verbs) (§'s 2, 3, and 4), which 
corresponds to (a), above; creating phrase frames that are associated with each 
semantically distinct phrase (where the phrases can include nouns and verbs, i.e., 
clauses) (Fig. 1, §2), which corresponds to (b), above; where each phrase has an index 
and a phrase frame (location and type) (§'s 2, 3), which corresponds to (c), above. 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Julliard in view of Voorhees and Arampatzis- 
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Indexing by specifically providing the steps indicated above, as taught by Arampatzis- 
Indexing, for the purpose of supporting an approach consistent with the Phrase 
Retrieval Hypothesis (§2, 111). 

Regarding claims 10-12, Julliard in view of Voorhees and Arampatzis-lndexing 
teach everything claimed, as applied above (see claims 9, 10, and 11, respectively). In 
addition, Julliard teaches that indexing can be used to (uniquely) access files (or 
documents) and more accurate information like the position of the sentence in which 
those words were found (where the organizational construct between documents and 
sentences is paragraphs) (col. 5, lines 25-44), which corresponds to "locating 
documents (or paragraphs or sentences) in said natural language text corpus; 
providing, for each document (or paragraph or sentence) of said natural language text 
corpus, a unique document (or paragraph or sentence) location identifier identifying the 
word tokens, phrases, clauses (spanned by the sentence or ), sentences (spanned by 
the paragraph or) and paragraphs spanned by the document; storing, for each 
document (or paragraph or sentence), its unique document (or paragraph or sentence) 
location identifier 

Regarding claim 18, Julliard in view of Voorhees teaches everything claimed, as 
applied above (see claim 16). But Julliard does not specifically teach "wherein said 
index comprises multiple indexes based on a hierarchy of text units that are related by 
inclusion." However, the examiner contends that this concept was well known in the art, 
as taught by Arampatzis-lndexing. 
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Arampatzis-lndexing teaches linguistically-motivated indexing schemes using 
multiple indexing sets for different representational choices where a phrase can be 
recursively represented (i.e., in a hierarchy). 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Julliard in view of Voorhees by specifically 
providing the indexing and hierarchal representation, as taught by Arampatzis-lndexing, 
for the purpose of obtaining more detailed representations of both the query and the 
text. 

3. Claims 14, 15 and 17 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Julliard in view of Voorhees and Arampatzis et al. ("Linguistically-motivated 
Information Retrieval"), hereinafter referred to as Arampatzis-Retrieval. 

Regarding claim 14, Julliard in view of Voorhees teaches everything claimed, as 
applied above (see claim 1), but Julliard does not specifically teach "organizing the 
extracted information according to degree of correspondence with the query with 
respect to lexical meaning of word tokens and surface syntactic roles of constituents, 
such that a constituent in a portion of text having the same lemma as the equivalent 
constituent of the query is considered to have a higher degree of correspondence than 
a constituent in a portion of text being a synonym to the equivalent constituent of the 
query." However, the examiner contends that this concept was well known in the art, as 
taught by Arampatzis-Retrieval. 
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Arampatzis-Retrieval teaches techniques for linguistically motivated information 
retrieval where the similarity between a document and a query can be determined by 
the function, S(d,q), allowing the document collection to be ranked in response to a 
query (§6) where the processing steps include tokenization, part-of-speech tagging, 
morphological normalization (lemma determination), lexicosemantical normalization 
(semantic clustering or expansion) (§7). 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Julliard in view of Voorhees by specifically 
providing the weighting and matching approach, as taught by Arampatzis-Retrieval, for 
the purpose of choosing the best match to a query. 

Regarding claim 15, Julliard in view of Voorhees teaches everything claimed, as 
applied above (see claim 1), but Julliard does not specifically teach "organizing the 
extracted information such that said portions of text are grouped according to sameness 
of grammatical subject, grammatical object, and lexical main verb." However, the 
examiner contends that this concept was well known in the art, as taught by 
Ara m patzi s-Retrie va I . 

Arampatzis-Retrieval teaches techniques for linguistically-motivated information 
retrieval which include weighing and matching the similarity between the document and 
the query. Julliard's approach includes the unnesting of a phrase frame where the main 
head carries the most semantic information of all the other elements in the frame and 
the frame includes grammatical information such as subject, object and verb (§'s 4-6). 
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Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Julliard in view of Voorhees by specifically 
providing the features, as taught by Arampatzis-Retrieval, for the purpose of 
determining a better match between the query and the text. 

Regarding claim 17, Julliard in view of Voorhees teaches everything claimed, as 
applied above (see claim 1); in addition, Julliard teaches the processing of a user query 
putting it into tagged form, applying grammar rules and then using mapping or indexing 
to locate a match within a text corpus (Fig. 2, col. 1, line 45 through col. 2, line 1, col. 5, 
lines 6-43), which corresponds to "wherein a string of word tokens in said indexed and 
stored analyzed natural language text corpus matches one of said surface variants." 
But Julliard does not specifically teach "or said analyzed natural language query if it 
comprises the head words of phrases bearing the grammatical relations of subject, 
object, and lexical main verb in said one of said surface variants or said analyzed 
natural language query in the same linear order as in said one of said surface variants 
or said analyzed natural language query." However, the examiner contends that this 
concept was well known in the art, as taught by Arampatzis-Retrieval. 

Arampatzis-Retrieval teaches techniques for linguistically-motivated information 
retrieval including weighing and matching the similarity between the document and the 
query. Arampatzis-Retrievars approach includes the unnesting of a phrase frame where 
the main head carries the most semantic information of all the other elements in the 
frame and the frame includes grammatical information such as subject, object and verb 



(§'s 4-< 
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Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Julliard in view of Voorhees by specifically 
providing the features, as taught by Arampatzis-Retrieval, for the purpose of 
determining a better match between the query and the text. 



Allowable Subject Matter 

4. Claims 21-28 are allowed. 

It is noted that the closest prior art of record, Julliard discloses a linguistic search 
method, but Julliard does not teach the linking of the stored word token location 
identifiers to the stored word types, such that, for a given identified word token, the 
stored word token location identifier identifying the location of the identified word token 
is logically, linked to the stored word type associated with the identified word token. 
Thus, independent claims 21 and 26 are allowable over the prior art of record because 
the cited prior art alone or in combination, does not fairly suggest or disclose the 
claimed combination of features. 

Response to Arguments 

5. Applicant's arguments with respect to claims 1-20 have been considered 
but are moot in view of the new ground(s) of rejection. 
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Conclusion 



Any response to this office action should be mailed to: 

Commissioner of Patents and Trademarks 
P.O. Box 1450 
Alexandria, VA 22313-1450 

or faxed to: 

(703) 872-9314 

Hand-delivered responses should be brought to: 

Crystal Park II 

2121 Crystal Drive 

Arlington, VA. 

Sixth Floor (Receptionist) 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Dr. V. Paul Harper whose telephone number is (703) 
305-4197. The examiner can normally be reached on Monday through Friday from 8:00 
a.m. to 4:30 p.m. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil, can be reached on (703) 305-9645. The fax phone 
number for the Technology Center 2600 is (703) 872-9314. 

Any inquiry of a general nature or relating to the status of this application or 
proceeding should be directed to the Technology Center 2600 Customer Service office 
whose telephone number is (703) 306-0377. 



VPH/vph L 
December 5, 2003 




